Event-triggered video creation with data augmentation

ABSTRACT

A method for creating a video that is generated based on the occurrence of pertinent events within a period of time. This video may be a summary video that includes video segments from multiple sources. The video may be augmented to display data describing pertinent events that occur.

BACKGROUND

In a world of ever increasing smart devices and smarter technologies,the internet of things (IoT) proposes that everyday objects and devices(e.g., light bulbs, washing machines and cameras) are to have aninternet connection, where these objects are enabled to send or receivedata to and from other devices, thus forming a larger network ofconnected things (e.g., objects or devices). The IoT promises smarterhomes and work environments where all the objects are seamlesslyconnected and controlled for easier and more convenient operation.

This disclosure is directed to addressing issues in the existingtechnology. This background information is provided to revealinformation believed by the applicant to be of possible relevance. Noadmission is necessarily intended, nor should be construed, that any ofthe preceding information constitutes prior art.

SUMMARY

Disclosed herein are methods, systems, and apparatuses that may be usedto create a video, in which the video is generated based on theoccurrence of pertinent events within a period of time. This video maybe a summary video that includes video segments from multiple sources.The video may be augmented to display data describing other pertinentevents that occur during the same period of time.

In an example, an apparatus may include a processor and a memory coupledwith the processor that effectuates operations. The operations mayinclude obtaining video of a monitored location; obtaining informationindicative of a device triggering event associated with a device,wherein the device is located in the monitored location; obtaininginformation indicative of a video triggering event associated with thevideo; based on the information indicative of the device triggeringevent or the video triggering event, determining descriptive event textfor a first period associated with the video; and creating a first videosegment associated with the descriptive event text for the first periodbased on the device triggering event or the video triggering event.

This Summary is provided to introduce a selection of concepts in asimplified form that are further described below in the DetailedDescription. This Summary is not intended to identify key features oressential features of the claimed subject matter, nor is it intended tobe used to limit the scope of the claimed subject matter. Furthermore,the claimed subject matter is not limited to limitations that solve anyor all disadvantages noted in any part of this disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

Reference will now be made to the accompanying drawings, which are notnecessarily drawn to scale.

FIG. 1 illustrates an exemplary system associated with event-triggeredvideo creation with data augmentation.

FIG. 2 illustrates an exemplary method associated with event-triggeredvideo creation with data augmentation.

FIG. 3 illustrates an exemplary event-triggered video creation with dataaugmentation.

FIG. 4 illustrates a schematic of an exemplary network device.

FIG. 5 illustrates an exemplary communication system that provideswireless telecommunication services over wireless communicationnetworks.

DETAILED DESCRIPTION

Disclosed herein are methods, systems, and apparatuses that may be usedto create a video, in which the video is generated based on theoccurrence of pertinent events within a period of time. This video maybe a summary video that includes video segments from multiple sources.The video may be augmented to display data describing other pertinentevents that occur during the same period of time. Exemplary scenariosfor event-triggered video creation with data augmentation are disclosedin more detail below, such a remote monitoring of a person in a seniorliving facility.

FIG. 1 illustrates an exemplary system associated with event-triggeredvideo creation with data augmentation. System 100 may include multipledevices that may be remote from home 116 (e.g., a house), such as videotimeline (VT) server 101 or mobile device 102 associated with user 103(e.g., monitoring party 103), among other things. System 100 may includemultiple devices that may be local to home 116, such as mobile device112 associated with user 111 (e.g., monitored party 111), smart speaker113, exterior sensor 114, interior sensor 115, router 118, or camera117, among other things. Home 116, which may be considered a domain ormonitored location, may include objects that are primarily notelectronic, such as furniture. Domain may be narrowed to home-kitchen,home-bedroom-camera 117, or the like. The devices of system 100 may becommunicatively connected with each other (e.g., via wireless or wiredconnections using network 104 or peer-to-peer (P2P) communication).

With continued reference to FIG. 1, in an example, VT server 101, mobiledevice 102, and the devices of home 116 may be communicatively connectedwith each other in order to obtain or provide information associatedwith monitored party 111, which may be a user of mobile device 112. VTserver 101 may obtain information from any of the devices of system 100,which may be associated with home 116 or user of mobile device 112, tohelp manage the disclosed event-triggered video creation with dataaugmentation that is associated with home 116. Mobile device 112 may beused to monitor user 111 and gather or disseminate information such asuser profile information, health information, user movement information,or the like. It is contemplated herein that the functions of the devicesof system 100 may be combined into one device or distributed overmultiple devices.

Mobile device 102 or mobile device 112 may include, wireless devices,such as satellite communication systems, portable digital assistants(PDAs), laptop computers, tablet devices, smart phones, smart watches,smart speakers, automobiles (e.g., autonomous vehicles), augmentedreality devices, virtual reality devices, virtual assistants, or thelike. Exterior sensor 114 or interior sensor 115 may include anenvironmental sensor, acoustic sensor, sound sensor, vibration sensor,fluid sensor, optical sensor, position sensor (e.g., accelerometer orgyroscope), speed sensor, chemical sensor, pressure sensor, camera, orthe like.

FIG. 2 illustrates an exemplary method associated with event-triggeredvideo creation with data augmentation. At step 121, a monitored location(e.g., domain) may be defined, such as home 116. This may or may notinclude monitored party 111. In one scenario, monitored party 111 may bean elderly person living in home 116. Home 116 may include one or moredevices that may detect events. This may include motion sensors,environmental sensors, microphones, health monitors, virtual assistants,doorbells, appliances, electronic pill boxes, environmental controls,cameras, home alarm systems, or smartphones, among other devices, all ofwhich may be communicatively connected with router 118, each other, ornetwork 104 (e.g., the Internet). If monitored part 111 is defined,monitored party 111 may be identified through facial recognition, voicerecognition (e.g., when speaking to a smart speaker), or use of mobiledevice 112, among other things.

At step 122, video of home 116 may be recorded by camera 117. This videomay be recorded based on a trigger (e.g., motion sensor) or camera 117may continually record. The triggers disclosed herein may be associatedwith the use of one or more of the devices, e.g., motion, use of pillbox, use of doorbell, a microphone that detects particular sounds oraudible words (e.g., “help,” a person's name, or smart speaker wake-upword), a particular decibel level of sound, or other sensors that detectpredetermined conditions. In an example, in addition, triggers may beassociated with a person's gait (e.g., detect change in gait), a personfalling, or threshold inactivity of a detected person, among otherthings. The video may be obtained by VT server 101.

At step 123, a triggering event (device or video) may be detected and atriggering event log may be created. A device triggering event may bebased on information obtained from one or more devices of system 100. Avideo triggering event may be based on information determined afteranalysis of the video. The triggering event may be the same as thetriggers as disclosed in step 122. A triggering event may be a detectedmotion, a doorbell pressed or ringing, thermostat set, an objectrecognized (e.g., rectangular box at door), a package alert received(e.g., via SMS text or smart speaker), a face recognized (e.g., aperson), or a door opened (e.g., detected via an alarm system sensor),among other things. These triggering events may be logged and obtainedby VT server 101.

At step 124, based on the device triggering event or the videotriggering event, descriptive event text may be determined. Descriptiveevent text may be considered a more detailed description of a period oftime. The descriptive event text may be text displayed on or near asubsequently created video. VT server 101 may analyze triggering eventlogs and analyze video during a period to determine the descriptiveevent text. In a first example, the device triggering event logs duringa first period may include an electronic calendar appointment for anin-home doctor's visit, a pressed doorbell, and a door opened andclosed. In addition, the video triggering event logs during the firstperiod may include an indication that a person that looks like a doctoror the audio from the video may indicate the word “doctor” was spoken.In this first example, VT server 101 may determine, based on theanalyzed triggering event logs and the video during the first period,that a doctor visit occurred during the first period. VT server 101 maydescribe the first period of the video as “Doctor Visit” instead of orin addition to indicating a “doorbell rang” and “conversation wasdetected.” It is contemplated that video from different cameras may becombined and synched for the first video segment. Using multipletriggering events in determining a descriptive event may increase theconfidence level of the accuracy of the determined descriptive eventtext. The confidence level may be sent to mobile device 102 and theremay be a threshold confidence that is required to be reached in order tocreate the video segment for the descriptive event text. It is alsocontemplated that a descriptive event image may be selected. Forexample, instead of the text “Doctor Visit” there may be an image (e.g.,icon) of a stethoscope.

At step 125, a summary video of a descriptive event (e.g., “DoctorVisit”—first video segment) for the first period may be created ordemarcated over the first period. This first video segment, as describedin more detail below, may be part of a larger period that may be asummary of an extended time frame (e.g., events during a work day, anentire day, an entire week, etc.).

At step 126, VT server 101 may determine whether to truncate the firstvideo segment. The first video segment may be truncated based on apredetermined set video segment length. For example, the first periodmay be 30 minutes and the predetermined set maximum video segment lengthmay be 5 minutes. In this example, the first video segment should betruncated.

At step 127, based on a determination that the first video segmentshould be truncated, truncating the first video segment. There aremultiple ways that the first video segment may be automaticallytruncated. In an example in which the segment length is reduced from 30minutes to 5 minutes, the first and last 2 minutes and 30 seconds of thefirst video segment may be used to create a second video segment of 5minutes total. In another example, short periods (e.g., 5 seconds or 30seconds each) after or before discrete triggering events of the firstvideo segment may be combined to create the truncated second segment.For example, the triggering events may be a pressed doorbell, dooropening, door closing, entering a first room, entering a second room, orkeywords spoken, among other things. The length of the “short periods”(sub-segments of the first video) may be based on the number oftriggering events of the first video. For example, 10 triggering eventswith a maximum of a 100-second video may be calculated to ten 10-secvideos. In another example, the first video segment may be truncatedbased on inactivity. For example, there may be an observation of littleto no movement and therefore redundant frames may be removed. Reductionin time of inactivity or other periods may be achieved by VT server 101setting playback of a video segment (e.g. a sub-segment of the secondvideo segment) to be at a faster speed. It is contemplated that anyoverlap in sub-segments may be addressed in a manner that there islittle to no repeat of the same video.

At step 128, integrating the truncated second segment into a summaryvideo of an extended time frame (e.g., highlight reel) for the morningor entire day, as described in more detail herein. At step 129, theentire summary video may be provided to monitoring party 103 via mobiledevice 102. Alternatively, the truncated second segment (or the firstsegment) may be provided to monitoring party 103 via mobile device 102,particularly when abnormal behavior or an emergency situation isdetected. It is also contemplated herein that many of the steps of FIG.2 may apply to the summary video.

FIG. 3 illustrates an exemplary event-triggered video creation with dataaugmentation. As shown in FIG. 3, there are several events andassociated videos that may be logged. Block 131 displays an exemplarytimeline of triggering events that may be logged, such as presseddoorbell, door opening, or conversation detected. At or near the sametime of the triggering events, video may be captured from one or morecameras. As shown, the video of block 141-block 147 may be associatedwith the triggering events on the timeline of block 131. Block 151 maybe a descriptive event which includes a summary video of block 141-block145 video that may be truncated based on predetermined information(e.g., descriptive event videos may only be 5 minutes long or less).Block 152 may be another descriptive event video of block 146 and block147 and may be truncated (e.g., 1 minute) as well to remove inactivity.Block 151 and block 152, among other blocks (not shown) may be combinedto create a summary video at block 153 that is sent to mobile device 102(e.g., “morning summary video”), which may be 1 minute long in thisexample.

Below is additional perspective, details, and scenarios associated withevent-triggered video creation with data augmentation. When triggeringevents are detected, an indication of the triggering event may be sent,indicating the type of event, and the time and date of the event, to anevents database (e.g., VT server 101), where they are logged along witha location ID, which is a unique identifier of the monitored location(e.g., home 116). Camera 117 may also have built-in logic that triggersactivation of recording and the creation of video segments when thevideo is analyzed for context by software resident on camera 117. Forinstance, camera 117 may monitor for the detection of changes in aperson's gait, a fall, or inactivity.

Video segments may be recorded and sent to a video database (e.g., VTserver 101). These video segments may include the time and date stampsand a description of the trigger that caused the creation of thesegment. For example, the triggering event may be motion detected,inactivity detected, fall detected, trip detected, audio detected, orothers. In an example, a virtual assistant (e.g., smart speaker 113) maylisten for triggering sound types or spoken words, such as a scream or aspoken “help.”

VT Server 101 may combine (e.g., concatenate) the pertinent videosegments together into a single video stream (e.g., summary video).Within the stream, a box or other display may be used to indicatetriggering events or descriptive events that occurred during the time ofthe video. The timestamps of the triggering events may be used todetermine when to insert the event displays as an overlay on the video.For instance, if a doctor visit event occurred at 9:55 am, then an eventalert may be displayed at the 9:55 am point within a 6:00 am-11:59 amvideo segment.

Some triggering events may occur at a time when no video was produced.In this case, a substitute video segment (e.g., a screen with a pictureof a thermostat or text) may be inserted by VT server 101 in itscreation of the summary video stream. This may be a short segment (e.g.,10 seconds) that permits time for the event text or still image todisplay. For instance, monitored party 111 may have set the thermostatto a low reading that is out of their comfort range. No video wastriggered for the event, but the substitute video segment permits theevent display. The result of the creation of the data augmented videostream (e.g., summary video) may be considered a “highlight” reel of themost pertinent video, augmented with time-synchronized event displays.

VT Server 101 may compress (as with regard to playback time or memory)some videos to permit the video stream to be shorter in duration. Forinstance, video segments that are triggered by inactivity detection maybe compressed in time by removing redundant frames. In another example,inactivity video segments may be achieved by VT Server 101 settingplayback for that segment to be at a faster speed (e.g., 2 x or 10 xplayback speed).

Monitoring party 103 may be alerted of the availability of “highlight”reels via an alert sent to mobile device 102. Alternatively, monitoringparty 103 may send a request to VT server 101 for playback. Some typesof triggering events may be set to be “critical,” in which a segment ofthe summary video is highlighted, a text indication of a possibleemergency is provided, or some other alert to indicate to monitoringparty 103 that the video should be reviewed as soon as possible. Forinstance, “critical” events (e.g., triggering events or descriptiveevents) may include missed pill dosage, garage door left open, pacemakermonitor alert, possible leg injury (e.g., based on same person differentgait), incorrect pills taken (e.g., incorrect pill box opened), orabnormal blood flow (e.g., through smartwatch or transdermal opticalimaging). Critical events may include trip detected, fall detected, andinactivity detected. Critical alerts may be noted on a listing ofavailable highlight videos, which may be displayed on mobile device 102.

FIG. 4 is a block diagram of network device 300 that may be connected toor comprise a component of system 100. Network device 300 may comprisehardware or a combination of hardware and software. The functionality tofacilitate telecommunications via a telecommunications network mayreside in one or combination of network devices 300. Network device 300depicted in FIG. 4 may represent or perform functionality of anappropriate network device 300, or combination of network devices 300,such as, for example, a component or various components of a cellularbroadcast system wireless network, a processor, a server, a gateway, anode, a mobile switching center (MSC), a short message service center(SMSC), an automatic location function server (ALFS), a gateway mobilelocation center (GMLC), a radio access network (RAN), a serving mobilelocation center (SMLC), or the like, or any appropriate combinationthereof. It is emphasized that the block diagram depicted in FIG. 4 isexemplary and not intended to imply a limitation to a specificimplementation or configuration. Thus, network device 300 may beimplemented in a single device or multiple devices (e.g., single serveror multiple servers, single gateway or multiple gateways, singlecontroller or multiple controllers). Multiple network entities may bedistributed or centrally located. Multiple network entities maycommunicate wirelessly, via hard wire, or any appropriate combinationthereof.

Network device 300 may comprise a processor 302 and a memory 304 coupledto processor 302. Memory 304 may contain executable instructions that,when executed by processor 302, cause processor 302 to effectuateoperations associated with mapping wireless signal strength. As evidentfrom the description herein, network device 300 is not to be construedas software per se.

In addition to processor 302 and memory 304, network device 300 mayinclude an input/output system 306. Processor 302, memory 304, andinput/output system 306 may be coupled together (coupling not shown inFIG. 4) to allow communications between them. Each portion of networkdevice 300 may comprise circuitry for performing functions associatedwith each respective portion. Thus, each portion may comprise hardware,or a combination of hardware and software. Accordingly, each portion ofnetwork device 300 is not to be construed as software per se.Input/output system 306 may be capable of receiving or providinginformation from or to a communications device or other network entitiesconfigured for telecommunications. For example, input/output system 306may include a wireless communications (e.g., 3G/4G/GPS) card.Input/output system 306 may be capable of receiving or sending videoinformation, audio information, control information, image information,data, or any combination thereof. Input/output system 306 may be capableof transferring information with network device 300. In variousconfigurations, input/output system 306 may receive or provideinformation via any appropriate means, such as, for example, opticalmeans (e.g., infrared), electromagnetic means (e.g., RF, Wi-Fi,Bluetooth®, ZigBee®), acoustic means (e.g., speaker, microphone,ultrasonic receiver, ultrasonic transmitter), or a combination thereof.In an example configuration, input/output system 306 may comprise aWi-Fi finder, a two-way GPS chipset or equivalent, or the like, or acombination thereof.

Input/output system 306 of network device 300 also may contain acommunication connection 308 that allows network device 300 tocommunicate with other devices, network entities, or the like.Communication connection 308 may comprise communication media.Communication media typically embody computer-readable instructions,data structures, program modules or other data in a modulated datasignal such as a carrier wave or other transport mechanism and includesany information delivery media. By way of example, and not limitation,communication media may include wired media such as a wired network ordirect-wired connection, or wireless media such as acoustic, RF,infrared, or other wireless media. The term computer-readable media asused herein includes both storage media and communication media.Input/output system 306 also may include an input device 310 such askeyboard, mouse, pen, voice input device, or touch input device.Input/output system 306 may also include an output device 312, such as adisplay, speakers, or a printer.

Processor 302 may be capable of performing functions associated withtelecommunications, such as functions for processing broadcast messages,as described herein. For example, processor 302 may be capable of, inconjunction with any other portion of network device 300, determining atype of broadcast message and acting according to the broadcast messagetype or content, as described herein.

Memory 304 of network device 300 may comprise a storage medium having aconcrete, tangible, physical structure. As is known, a signal does nothave a concrete, tangible, physical structure. Memory 304, as well asany computer-readable storage medium described herein, is not to beconstrued as a signal. Memory 304, as well as any computer-readablestorage medium described herein, is not to be construed as a transientsignal. Memory 304, as well as any computer-readable storage mediumdescribed herein, is not to be construed as a propagating signal. Memory304, as well as any computer-readable storage medium described herein,is to be construed as an article of manufacture.

Memory 304 may store any information utilized in conjunction withtelecommunications. Depending upon the exact configuration or type ofprocessor, memory 304 may include a volatile storage 314 (such as sometypes of RAM), a nonvolatile storage 316 (such as ROM, flash memory), ora combination thereof. Memory 304 may include additional storage (e.g.,a removable storage 318 or a non-removable storage 320) including, forexample, tape, flash memory, smart cards, CD-ROM, DVD, or other opticalstorage, magnetic cassettes, magnetic tape, magnetic disk storage orother magnetic storage devices, USB-compatible memory, or any othermedium that can be used to store information and that can be accessed bynetwork device 300. Memory 304 may comprise executable instructionsthat, when executed by processor 302, cause processor 302 to effectuateoperations to map signal strengths in an area of interest.

FIG. 5 depicts an exemplary diagrammatic representation of a machine inthe form of a computer system 500 within which a set of instructions,when executed, may cause the machine to perform any one or more of themethods described above. One or more instances of the machine canoperate, for example, as processor 302, smart speaker 113, exteriorsensor 114, interior sensor 115, router 118, or camera 117 and otherdevices of FIG. 1. It is contemplated that listed devices may be remotefrom or local to home 116 in certain situations. In some examples, themachine may be connected (e.g., using a network 502) to other machines.In a networked deployment, the machine may operate in the capacity of aserver or a client user machine in a server-client user networkenvironment, or as a peer machine in a peer-to-peer (or distributed)network environment.

The machine may comprise a server computer, a client user computer, apersonal computer (PC), a tablet, a smart phone, a laptop computer, adesktop computer, a control system, a network router, switch or bridge,or any machine capable of executing a set of instructions (sequential orotherwise) that specify actions to be taken by that machine. It will beunderstood that a communication device of the subject disclosureincludes broadly any electronic device that provides voice, video ordata communication. Further, while a single machine is illustrated, theterm “machine” shall also be taken to include any collection of machinesthat individually or jointly execute a set (or multiple sets) ofinstructions to perform any one or more of the methods discussed herein.

Computer system 500 may include a processor (or controller) 504 (e.g., acentral processing unit (CPU)), a graphics processing unit (GPU, orboth), a main memory 506 and a static memory 508, which communicate witheach other via a bus 510. The computer system 500 may further include adisplay unit 512 (e.g., a liquid crystal display (LCD), a flat panel, ora solid state display). Computer system 500 may include an input device514 (e.g., a keyboard), a cursor control device 516 (e.g., a mouse), adisk drive unit 518, a signal generation device 520 (e.g., a speaker orremote control) and a network interface device 522. In distributedenvironments, the examples described in the subject disclosure can beadapted to utilize multiple display units 512 controlled by two or morecomputer systems 500. In this configuration, presentations described bythe subject disclosure may in part be shown in a first of display units512, while the remaining portion is presented in a second of displayunits 512.

The disk drive unit 518 may include a tangible computer-readable storagemedium 524 on which is stored one or more sets of instructions (e.g.,software 526) embodying any one or more of the methods or functionsdescribed herein, including those methods illustrated above.Instructions 526 may also reside, completely or at least partially,within main memory 506, static memory 508, or within processor 504during execution thereof by the computer system 500. Main memory 506 andprocessor 504 also may constitute tangible computer-readable storagemedia.

As described herein, a telecommunications system wherein management andcontrol utilizing a software defined network (SDN) and a simple IP arebased, at least in part, on user equipment, may provide a wirelessmanagement and control framework that enables common wireless managementand control, such as mobility management, radio resource management,QoS, load balancing, etc., across many wireless technologies, e.g. LTE,Wi-Fi, and future 5G access technologies; decoupling the mobilitycontrol from data planes to let them evolve and scale independently;reducing network state maintained in the network based on user equipmenttypes to reduce network cost and allow massive scale; shortening cycletime and improving network upgradability; flexibility in creatingend-to-end services based on types of user equipment and applications,thus improve customer experience; or improving user equipment powerefficiency and battery life—especially for simple M2M devices—throughenhanced wireless management.

While examples of a telecommunications system in which event-triggeredvideo creation with data augmentation can be processed and managed havebeen described in connection with various computing devices/processors,the underlying concepts may be applied to any computing device,processor, or system capable of facilitating a telecommunicationssystem. The various techniques described herein may be implemented inconnection with hardware or software or, where appropriate, with acombination of both. Thus, the methods and devices may take the form ofprogram code (i.e., instructions) embodied in concrete, tangible,storage media having a concrete, tangible, physical structure. Examplesof tangible storage media include floppy diskettes, CD-ROMs, DVDs, harddrives, or any other tangible machine-readable storage medium(computer-readable storage medium). Thus, a computer-readable storagemedium is not a signal. A computer-readable storage medium is not atransient signal. Further, a computer-readable storage medium is not apropagating signal. A computer-readable storage medium as describedherein is an article of manufacture. When the program code is loadedinto and executed by a machine, such as a computer, the machine becomesa device for telecommunications. In the case of program code executionon programmable computers, the computing device will generally include aprocessor, a storage medium readable by the processor (includingvolatile or nonvolatile memory or storage elements), at least one inputdevice, and at least one output device. The program(s) can beimplemented in assembly or machine language, if desired. The languagecan be a compiled or interpreted language, and may be combined withhardware implementations.

The methods and devices associated with a telecommunications system asdescribed herein also may be practiced via communications embodied inthe form of program code that is transmitted over some transmissionmedium, such as over electrical wiring or cabling, through fiber optics,or via any other form of transmission, wherein, when the program code isreceived and loaded into and executed by a machine, such as an EPROM, agate array, a programmable logic device (PLD), a client computer, or thelike, the machine becomes an device for implementing telecommunicationsas described herein. When implemented on a general-purpose processor,the program code combines with the processor to provide a unique devicethat operates to invoke the functionality of a telecommunicationssystem.

While a telecommunications system has been described in connection withthe various examples of the various figures, it is to be understood thatother similar implementations may be used or modifications and additionsmay be made to the described examples of a telecommunications systemwithout deviating therefrom. For example, one skilled in the art willrecognize that a telecommunications system as described in the instantapplication may apply to any environment, whether wired or wireless, andmay be applied to any number of such devices connected via acommunications network and interacting across the network. Therefore, atelecommunications system as described herein should not be limited toany single example, but rather should be construed in breadth and scopein accordance with the appended claims.

In describing preferred methods, systems, or apparatuses of the subjectmatter of the present disclosure—event-triggered video creation withdata augmentation—as illustrated in the Figures, specific terminology isemployed for the sake of clarity. The claimed subject matter, however,is not intended to be limited to the specific terminology so selected.In addition, the use of the word “or” is generally used inclusivelyunless otherwise provided herein.

This written description uses examples to enable any person skilled inthe art to practice the claimed subject matter, including making andusing any devices or systems and performing any incorporated methods.Other variations of the examples are contemplated herein.

Methods, systems, and apparatuses, among other things, as describedherein may provide for obtaining video of a monitored location;obtaining information indicative of a device triggering event associatedwith a device, wherein the device is located in the monitored location;obtaining information indicative of a video triggering event associatedwith the video; based on the information indicative of the devicetriggering event or the video triggering event, determining descriptiveevent text for a first period associated with the video; and creating afirst video segment associated with the descriptive event text for thefirst period based on the device triggering event and the videotriggering event. The method, system, computer readable storage medium,or apparatus may provide for based on the time length of the first videosegment, preparing a truncated first video segment of the first videosegment with the descriptive event text. The method, system, computerreadable storage medium, or apparatus may provide for integrating thetruncated first video segment into a summary video segment. The method,system, computer readable storage medium, or apparatus may provide fordisplaying an indication of the summary video segment to a mobiledevice. The device may include a mobile device or a smart speaker. Theinformation indicative of the video triggering event or the devicetriggering event may include a threshold decibel level of a spoken wordor other audio. All combinations in this paragraph (including theremoval or addition of steps) are contemplated in a manner that isconsistent with the other portions of the detailed description.

What is claimed:
 1. An apparatus comprising: a processor; and a memorycoupled with the processor, the memory storing executable instructionsthat when executed by the processor cause the processor to effectuateoperations comprising: obtaining video of a monitored location;obtaining information indicative of a device triggering event associatedwith a device, wherein the device is located in the monitored location;obtaining information indicative of a video triggering event associatedwith the video; based on the information indicative of the devicetriggering event and the video triggering event, determining descriptiveevent text for a first period associated with the video; creating afirst video segment of the video, wherein the first video segment isassociated with the descriptive event text for the first period based onthe device triggering event and the video triggering event; determininga time length of the first video segment; based on the time length ofthe first video segment, preparing a truncated first video segment ofthe first video segment with the descriptive event text; and integratingthe truncated first video segment into a summary video segment.
 2. Theapparatus of claim 1, the operations further comprising based on timelength of the first video segment, preparing a truncated first videosegment of the first video segment with the descriptive event text. 3.The apparatus of claim 1, the operations further comprising displayingan indication of the summary video segment to a mobile device.
 4. Theapparatus of claim 1, wherein the device comprises a mobile device. 5.The apparatus of claim 1, wherein the device comprises a smart speaker.6. The apparatus of claim 1, wherein the information indicative of thevideo triggering event or the device triggering event comprises athreshold decibel level of a spoken word.
 7. A system comprising: amobile device; and apparatus communicatively connected with the mobiledevice, the apparatus comprising: a processor; and a memory coupled withthe processor, the memory storing executable instructions that whenexecuted by the processor cause the processor to effectuate operationscomprising: obtaining video of a monitored location; obtaininginformation indicative of a device triggering event associated with adevice, wherein the device is located in the monitored location;obtaining information indicative of a video triggering event associatedwith the video; based on the information indicative of the devicetriggering event and the video triggering event, determining descriptiveevent text for a first period associated with the video; creating afirst video segment of the video, wherein the first video segment isassociated with the descriptive event text for the first period based onthe device triggering event and the video triggering event; determininga time length of the first video segment; based on the time length ofthe first video segment, preparing a truncated first video segment ofthe first video segment with the descriptive event text; and integratingthe truncated first video segment into a summary video segment.
 8. Thesystem of claim 7, the operations further comprising based on timelength of the first video segment, preparing a truncated first videosegment of the first video segment with the descriptive event text. 9.The system of claim 7, the operations further comprising displaying anindication of the summary video segment to the mobile device.
 10. Thesystem of claim 7, wherein the device comprises a sensor.
 11. The systemof claim 7, wherein the device comprises a smart speaker.
 12. The systemof claim 7, wherein the information indicative of the video triggeringevent or the device triggering event comprises a threshold decibel levelof a spoken word.
 13. A computer readable storage medium storingcomputer executable instructions that when executed by a computingdevice cause said computing device to effectuate operations comprising:obtaining video of a monitored location; obtaining informationindicative of a device triggering event associated with a device,wherein the device is located in the monitored location; obtaininginformation indicative of a video triggering event associated with thevideo; based on the information indicative of the device triggeringevent and the video triggering event, determining descriptive event textfor a first period associated with the video; creating a first videosegment of the video, wherein the first video segment is associated withthe descriptive event text for the first period based on the devicetriggering event and the video triggering event; determining a timelength of the first video segment; based on the time length of the firstvideo segment, preparing a truncated first video segment of the firstvideo segment with the descriptive event text; and integrating thetruncated first video segment into a summary video segment.
 14. Thecomputer readable storage medium of claim 13, the operations furthercomprising based on time length of the first video segment, preparing atruncated first video segment of the first video segment with thedescriptive event text.
 15. The computer readable storage medium ofclaim 13, wherein the device comprises a sensor.
 16. The computerreadable storage medium of claim 13, wherein the device comprises asmart speaker.
 17. The computer readable storage medium of claim 13,wherein the information indicative of the video triggering event or thedevice triggering event comprises a threshold decibel level of a spokenword.